PATENT 

Serial No. 10/522,468 

Amendment in Reply to Office Action mailed on December 11, 2006 

IN THE SPECIFICATION 
Please amend the specification as follows: 

Replace the paragraph on page 2, between lines 2 7-28 of the 
specification with the following: 

It ic an object of the It is an object of the invention to 
provide a method and system for automatically determining a reading 
of a Japanese word. 

Replace the paragraph spanning pages 3-4, between page 3, line 
23, and page 4, line 4 of the specification with the following: 

A s described in the dependent claim 2, for For_a kan j i 
character that in the word is not immediately preceded or succeeded 
by a kanji character, the method includes choosing a most frequent 
one of a plurality of kun-readings associated with the kanji 
character. Some kanji characters may be associated with several 
different kun-readings. The most frequently occurring one is 
selected. The several options may all be stored in a memory, 
possibly with their relative frequency of occurrence (or sorted on 
frequency) . In this way, the method may, optionally, enable a user 
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to select a different reading. If this is not required, the method 
may include storing the most frequent kun-reading of each kanji 
character in a memory for use during the conversion of a Japanese 
word in a textual form to an acoustical form. Similarly, as 
described in the dependent claim 3, for a kanji character that in 
the word is immediately preceded or succeeded by at least one kanji 
character, the method includes choosing a most frequent one of a 
plurality of on- readings associated with the kanji character. 

Replace the paragraph on page 4, between lines 5-9 of the 
specification with the following: 

Ac dcocrib e d in - 1 ^^0^^ oTnbod-i mant of the dependent claim 
d, t h e The m ost frequent on-reading 4s -may be selected by also 
considering the neighboring kanji character (s) . For the group of 
two or more kanji characters the most frequent on- reading is chosen 
and applied to the characters of the group. In this way, the 
quality can be improved further than when the decision is made 
solely based on the frequency of reading of isolated characters. 

Replace the paragraph on page 4, between lines 10-11 of the 
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specification with the following: 

A n described in the dependent claim 5, each Each hiragana 
character 4s- may be associated with one reading and for a hiragana 
character of the word the associated reading is chosen. 

Replace the paragraph on page 4, between lines 12-14 of the 
specification with the following: 

As described in the dependent claim 6, each Each katakana 
character 4s -may be associated with a corresponding hiragana 
character; and for a katakana character of the word choosing the 
reading associated with the hiragana character corresponding to the 
katakana character. 

Replace the paragraph spanning pages 10-11, between page 10, 
line 3, and page 11, line 5 of the specification with the 
following: 

Fig. 4 shows a further table for use by the method. The table 
has a separate row for each kan j i character supported by the 
system. Preferably, all kan j i characters are supported, which are 
about-g-O-OQ 60 0 0 different characters. If so desired, the number of 
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supported kanji characters may be limited, for example to the 500 
or 1000 most used characters. A suitable subset is the "Joyo kanji" 
list, an official listing of 1,945 kanji characters published in 
1981 by the Japanese Ministry of Education. The list comprises all 
the kanji one might expect to encounter in "everyday use" - on 
signs, in newspapers and so on. In the exemplary table, column 410 
identifies the kanji character, for example in a digital form, 
using a two-byte representation. Any suitable sequence may be used. 
In column 42 0, a corresponding acoustic transcription is stored for 
each of the kanji characters in the form of a representation of the 
most frequent on- reading of the character, for example using a 
phonetic or other suitable representation. In column 430, a further 
corresponding acoustic transcription is stored for each of the 
kanji characters in the form of a representation of the most 
frequent kun-reading of the character. It is well-known to create 
acoustic representations of the different classes of reading of 
kanji characters and the choices within each class are well-known 
and this will not be described in more detail here. It will be 
appreciated that instead of one acoustic representation stored in 
each of the columns 420 and 43 0, the system may include several 



TW02 0011-amd-02-2S-07.doc 



6 



PATENT 

Serial No. 10/522,468 

Amendment in Reply to Office Action mailed on December 11, 2006 

acoustic representations for each of the readings, where each sub- 
column with a different representation corresponds to a regional 
variation in pronunciation (also referred to as accent) . The table 
shown in Fig. 4, in principle, enables finding an acoustic 
transcription for individual kanj i characters. Below, more details 
will be given on determining a preferred acoustic transcription. 
Since hiragana (and also katakana) can be used as an acoustic 
representation ("reading") of a kanji character, columns 420 and 
430 may also include one or more hiragana characters that represent 
the acoustic transcription (or if so desired, the columns may 
include katakana characters) . In this way, the table can also be 
used for converting individual kanji characters to hiragana (and/or 
katakana) characters. As such, the combination of tables shown in 
Figs 3 and 4, enable transcription of Japanese text corpora to 
hiragana (and/or katakana) . For applications, like speech 
recognition and speech synthesis, it is usually preferred to also 
have access to an acoustic representation other than hiragana or 
katakana) . For this purpose, column 330 of Fig. 3 and columns 420 
and 430 of Fig. 4 can be used. If the purpose is solely to perform 
a transcription of Japanese text corpora, column 330 is not 
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required. Instead in columns 42 0 and 43 0 the hiragana (or katakana) 
transcription can be given as the on- reading and kun- reading, 
respectively. 
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